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(57) A method of decoding a speech signal based on a CELP (Code Excited Linear Prediction) with improvement in 
degradation of decoded sound quality in a noise period. The method includes the steps of: calculating a norm of an 
excitation vector for each fixed period in a noise period; smoothing the calculated norm using a norm obtained in a 
previous period; changing the amplitude of the excitation vector in the period using the calculated norm and the 
smoothed norm; and driving a synthesizing filter by the excitation vector with the changed amplitude. 
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Abstract of the Disclosure: 

A method of decoding a speech signal based on a CELP 
(Code Excited Linear Prediction) with improvement in 
degradation of decoded sound quality in a noise period. 
The method includes the steps of: calculating a norm of an 
excitation vector for each fixed period in a noise period; 
smoothing the calculated norm using a norm obtained in a 
previous period; changing the amplitude of the excitation 
vector in the period using the calculated norm and the 
smoothed norm; and driving a synthesizing filter by the 
excitation vector with the changed amplitude. 
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METHOD AND APPARATUS FOR DECODING SPEECH SIGNAL 

BACKGROUND OF THE INVENTION 
1. Field of the Invention: 
5 The present invention relates generally to a coding 

and decoding technique for transmitting speech signals at a 
low bit rate, and more particularly to a decoding method 
and a decoding apparatus for improving sound quality in an 
environment where noise exists. 
10 2. Description of the Prior Art: 

Methods of coding a speech signal by separating the 
speech signal to a linear prediction filter and its driving 
excitation signal (also referred to as excitation signal or 
excitation vector) are widely used as a method of 
15 efficiently coding a speech signal at an intermediate or 
low bit rate. One typical method thereof is CELP (Code 
Excited Linear Prediction). In the CELP, an excitation 
signal (excitation vector) drives a linear prediction 
filter for which a linear prediction coefficient 
20 representing frequency characteristics of input speech is 
set, thereby obtaining a synthesized speech signal 
(reproduced speech, reproduced vector). The excitation 
signal is represented by the sum of a pitch signal (pitch 
vector) representing a pitch period of speech and a sound 
25 source signal (sound source vector) comprising random 

numbers or pulses. In this case, each of the pitch signal 
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and the sound source signal is multiplied by gain (i.e., 
pitch gain and sound source gain) . For the CELP, reference 
can be made to M. Schroeder et al., "Code excited linear 
prediction: High quality speech at very low bit rates", 

5 Proc, of IEEE Int. Conf. on Acoust., Speech and Signal 
processing, pp. 937-940, 1985 (Literature 1). 

Mobile communication systems such as a cellular phone 
system require favorable quality of speech in noisy 
environments typified by the hustle and bustle in downtown 

10 or the inside of a running car. However, speech coding 

techniques based on the CELP have a problem of significant 
deterioration of sound quality for speech on which noise is 
superimposed ,that is, speech with background noise. A 
time period in a speech signal under a noisy environment is 

15 referred to as a noise period. 

For improving the quality of coded speech from the 
speech with background noise, a method of smoothing the 
sound source gain at a decoder has been proposed. In this 
method, the smoothing of the sound source gain causes a 

20 smooth change with time in short time average power of the 
sound source signal multiplied by the sound source gain, 
resulting in a smoothed change with time in short time 
average power of the excitation signal as well. This leads 
to mitigation of significant variations in short time 

25 average power in decoded noise, which is one of factors for 
degradation, thereby improving the sound quality. 
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For a method of smoothing gain in the sound source 
signal, reference can be made, for example, to Section 6-1 
of "Digital Cellular Telecommunication System; Adaptive 
Multi-Rate Speech Transcoding", ETSI Technical Report, GSM 

5 06.90, version 2.0.0 (Literature 2). 

Fig. 1 is a block diagram showing an example of a 
configuration of a conventional speech signal decoding 
apparatus, and illustrates a technique of improving quality 
of coding of a speech with background noise by smoothing 

10 gain in a sound source signal. Assume herein that bit 
sequences are inputted at a frame period of Tf r (for 
example, 20 milliseconds), and reproduced vectors are 
calculated at a subframe period of (Tf r /N s f r ) (for example, 
5 milliseconds) where N s f r is an integer number (for 

15 example, 4). A frame length is Lf r samples (for example, 
320 samples), and a subframe length is L s f r samples (for 
example, 80 samples). These numbers of samples are 
employed in the case of a sampling frequency of 16 kHz for 
input signals. Description is hereinafter made for the 

20 speech signal decoding apparatus shown in Fig. 1. 

Bit sequences of coded data are supplied from input 
terminal 10. Code input circuit 1010 divides and converts 
the bit sequences supplied from input terminal 10 to 
indexes corresponding to a plurality of decoding parameters. 

25 Code input circuit 1010 provides an index corresponding to 
an LSP (Line Spectrum Pair) representing the frequency 
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characteristic of the input signal to LSP decoding circuit 
1020/ an index corresponding to delay representing the 
pitch period of the input signal to pitch signal decoding 
circuit 1210, an index corresponding to a sound source 
5 vector including random numbers or pulses to sound source 
signal decoding circuit 1110, an index corresponding to a 
first gain to first gain decoding circuit 1220, and an 
index corresponding to a second gain to second gain 
decoding circuit 1120. 
10 LSP decoding circuit 1020 contains a table in which 

plural sets of LSPs are stored. LSP decoding circuit 1020 
receives, as its input, the index outputted from code input 
circuit 1010, reads the LSP corresponding to that index 
from the table contained therein, and sets the read LSP to 
15 LSP: qj Nsfr) (n), j = l,...,N p in N s f r th subframe of the current 
frame (n-th frame), where N p represents a linear prediction 
order. The LSPs from the first to (N s f r -l)th subframes are 
derived by linear interpolation of qj Nsfr) (n) and q j Nsfr) (n - 1) . 
LSP decoding circuit 1020 outputs the LSP: q^ m) (n) , 
20 j=lf-.*/N p , m-l,...,N s f r to linear prediction coefficient 
converting circuit 1030 and to smoothing coefficient 
calculating circuit 1310. 

Linear prediction coefficient converting circuit 1030 
converts the LSP: q^ m) (n) supplied from LSP decoding circuit 
25 1020 to linear prediction coefficient d^(n), j = l, ...,N p , 
m=l , . . . ,N s f r , and outputs it to synthesizing filter 1040. 
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It should be noted that, for the conversion from the LSP to 
the linear prediction coefficient, known methods can be 
used, for example the method described in Section 5.2.4 of 

Literature 2 . 

5 sound source signal decoding circuit 1110 contains a 

table in which a plurality of sound source vectors are 
stored. Sound source signal decoding circuit 1110 receives 
the index outputted from code input circuit 1010, reads the 
sound source vector corresponding to that index from the 

10 table contained therein, and outputs it to second gain 
circuit 1130. 

First gain decoding circuit 1220 includes a table in 
which a plurality of gains are stored. First gain decoding 
circuit 1220 receives, as its input, the index outputted 
15 from code input circuit 1010, reads the first gain 
corresponding to that index from the table contained 
therein, and outputs it to first gain circuit 1230. 

Second gain decoding circuit 1120 contains another 
table in which a plurality of gains are stored. Second 
20 gain decoding circuit 1120 receives, as its input, the 

index from code input circuit 1010, reads the second gain 
corresponding to that index from the table contained 
therein, and outputs it to smoothing circuit 1320. 

First gain circuit 1230 receives, as its inputs, a 
25 first pitch vector, later described, outputted from pitch 
signal decoding circuit 1210 and the first gain outputted 
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from first gain decoding circuit 1220, multiplies the first 
pitch vector by the first gain to produce a second pitch 
vector, and outputs the produced second pitch vector to 
adder 1050. 

5 Second gain circuit 1130 receives, as its inputs, the 

first sound source vector from sound source signal decoding 
circuit 1110 and the second gain, later described, from 
smoothing circuit 1320, multiplies the first sound source 
vector by the second gain to produce a second sound source 

10 vector, and outputs the produced second sound source vector 
to adder 1050. 

Adder 1050 calculates the sum of the second pitch 
vector from first gain circuit 1230 and the second sound 
source vector from second gain circuit 1130 and outputs the 

15 result of the addition to synthesizing filter 1040 as an 
excitation vector. 

Storage circuit 1240 receives the excitation vector 
from adder 1050 and holds it. Storage circuit 1240 outputs 
the excitation vectors which were previously received and 

20 held thereby to pitch signal decoding circuit 1210. 

Pitch signal decoding circuit 1210 receives, as its 
inputs, the previous excitation vectors held in storage 
circuit 1240 and the index from code input circuit 1010. 
The index specifies a delay L p( j. Pitch signal decoding 

25 circuit 1210 takes a vector for L S f r samples corresponding 
to a vector length from the point going back Lpd samples 
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from the beginning of the current frame in the previous 
excitation vectors to produce a first pitch signal (i.e., 
first pitch vector) . When L pd < L s f r , a vector for L pd 
samples is taken, and the taken L pd samples are repeatedly 
connected to produce a first pitch vector with a vector 
length of L sfr samples. Pitch signal decoding circuit 1210 
outputs the first pitch vector to first gain circuit 1230. 

Smoothing coefficient calculating circuit 1310 
receives the LSP: qf°(n) outputted from LSP decoding 
circuit 1020, and calculates an average LSP: q 0 j(n) in n-th 
frame with the following equation: 

q oj (n) = 0.84 • q oj (n - 1) + 0.16 • qf rfr) (n) 

Next, smoothing coefficient calculating circuit 1310 
calculates a variation d 0 (m) of the LSP for each subframe m 
15 with the following equation: 



10 



20 



, ^ ^ lq 0j (n)-qf>(n)| 



25 



>i qoj( n ) 

A smoothing coefficient k 0 (m) in subframe m is calculated 
with the following equation: 

k 0 (m) = min(0.25, max(0, d 0 (m)-0 .4) )/0 .25 
where min(x,y) is a function which takes on a smaller one 
of x and y, while max(x,y) is a function which takes on a 
larger one of x and y. Finally, smoothing coefficient 
calculating circuit 1310 outputs the smoothing coefficient 
ko(m) to smoothing circuit 1320. 

Smoothing circuit 1320 receives, as its inputs, the 
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15 



20 



smoothing coefficient k 0 (m) from smoothing coefficient 
calculating circuit 1310 and the second gain from second 
gain decoding circuit 1120. Smoothing circuit 1320 
calculates an average gain g 0 (m) from a second gain g 0 (m) 
in a subframe m with the following equation: 
1 4 

go( m ) = 7Z9o( m - i ) 

3 i=0 

Next, the following equation is substituted for the 
second gain: 

g 0 (*0 = 3o(nO • k 0 (n>) + g 0 («») • C 1 - k o( m » 

Finally, smoothing circuit 1320 outputs the 
substituted second gain to second gain circuit 1130. 

Synthesizing filter 1040 receives, as its inputs, the 
excitation vector from adder 1050 and the linear prediction 

coefficient af\n) , J-l V * =1 N ^ fr ° m Unear 

prediction coefficient converting circuit 1030. In 
synthesizing filter 1040, the excitation vector drives the 
synthesizing filter (1/A(e)) for which the linear 
prediction coefficient is set to calculates a reproduced 
vector which is then outputted from output terminal 20. 

The transfer function of synthesizing filter 1040 is 
represented as follows: 
1 1 



l-£ a i z 

where the linear prediction coefficient is a it i=l,...,N p . 
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Next, a conventional speech signal coding apparatus 
is described. Fig. 2 is a block diagram showing an example 
of a configuration of a speech signal coding apparatus used 
in a conventional speech signal coding and decoding system. 
5 The speech signal coding apparatus is used in a pair with 
the speech signal decoding apparatus shown in Fig. 1 such 
that coded data outputted from the speech signal coding 
apparatus is transmitted and inputted to the speech signal 
decoding apparatus shown in Fig. 1. Since the operations 
10 of first gain circuit 1230, second gain circuit 1130, adder 
1050 and storage circuit 1240 in Fig. 2 are similar to 
those of the respective corresponding functional blocks 
described for the speech signal decoding apparatus shown in 
Fig. 1, the description thereof is not repeated here. 
15 in the apparatus shown in Fig. 2, speech signals are 

sampled, and a plurality of the resultant samples are 
formed into one vector as one frame to produce an input 
signal (input vector) which is then inputted from input 
terminal 30. 

20 Linear prediction coefficient calculating circuit 

5510 performs linear prediction analysis on the input 
vector supplied from input terminal 30 to derive a linear 
prediction coefficient. For the linear prediction analysis, 
reference can be made to known methods, for example, in 

25 Section 8 "Linear Predictive Coding of Speech" of "Digital 
Processing of Speech Signals", L. R. Rabiner et al., 
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Prentice-Hall, 1978 (Literature 3). Linear prediction 
coefficient calculating circuit 5510 outputs the derived 
linear prediction coefficient to LSP 
conversion/quantization circuit 5520. 
5 LSP conversion/quantization circuit 5520 receives the 

linear prediction coefficient from linear prediction 
coefficient calculating circuit 5510, converts the linear 
prediction coefficient to an LSP, quantizes the LSP to 
derive the quantized LSP. For the conversion from the 

10 linear prediction coefficient to the LSP, known methods can 
be referenced, for example, the method described in Section 
5.2.4 of Literature 2. For the quantization of the LSP, 
the method described in Section 5.2.5 of Literature 2 can 
be referenced. The quantized LSP is set to a quantized 

15 LSP: qj Nsfr) (n) , j=l,...,N p in N sfr th subframe of the current 
frame (n-th frame), similarly to the LSP in the LSP 
decoding circuit of the speech signal decoding apparatus 
shown in Fig. 1. The quantized LSPs from the first to 
( N sfr-l)th subframes are derived by linear interpolation of 

20 q5 Nsfr) (n) and qf sfr) (n - 1) . The LSP is set to an LSP in a 

(Nsfr- 1 ) 1 ^ subframe of the current frame (n-th frame). The 
LSPs from the first to (N s f r -l)th subframes are derived by 
linear interpolation of qj Nsfr) (n) and qj Nsfr) (n - 1) . 

LSP conversion/quantization circuit 5520 outputs the 

25 LSP: q^ m) (n) , j=l,...,N p , m=l,...,N s f r and the quantized 

LSP: q^ m) (n) , j = l,...,N p , m=l,...,N S f r to linear prediction 
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coefficient converting circuit 5030, and outputs the index 
corresponding to the quantized LSP: qj Nsfr) (n) to code output 
circuit 6010. 

Linear prediction coefficient converting circuit 5030 
receives, as its inputs, the LSP: q^ m) (n) and the quantized 
LSP: q^ m) (n) from LSP conversion/quantization circuit 5520, 
converts the LSP ( q!| m) (n) ) to a linear prediction 
coefficient [<x$ m) (n), j=l,...,N p , m=l, . . . ,N sfr ] , converts 
the quantized LSP ( q^ m) (n) ) to a quantized linear 
prediction coefficient: 6^ m) (n), j = l,-..,N p , m=l , . . - ,N S f r , 
outputs the linear prediction coefficient a^ m) (n) to 
weighting filter 5050 and to weighting synthesizing filter 
5040, and outputs the quantized linear prediction 
coefficient d^ m) (n) to weighting synthesizing filter 5040. 
For the conversion from the LSP to the linear prediction 
coefficient and the conversion from the quantized LSP to 
the quantized linear prediction coefficient, known methods 
can be referenced, for example, the method described in 
Section 5.2.4 of Literature 2. 

Weighting filter 5050 receives, as its inputs, the 
input vector from input terminal 30 and the linear 
prediction coefficient txjj m) (n) from linear prediction 
coefficient converting circuit 5030, uses the linear 
prediction coefficient to produce a transfer function W(z) 
of the weighting filter corresponding to human auditory 
characteristics. The weighting filter is driven by the 
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input vector to obtain a weighted input vector. Weighting 
filter 5050 outputs the weighted input vector to 
differentiator 5070. The transfer function W(z) of the 
weighting filter is represented as follows: 

W(z) = Q(z/Yi)/Q(z/Y 2 ) 
Here, the followings hold: 



N p 



QWYiH-Zaiftf* 1 



i=l 
N p 



10 



Yl and Y 2 are constants, for example, Yl =0S and y 2 =0.6. 
For details on the weighting filter. Literature 1 can be 
referenced . 

Weighting synthesizing filter 5040 receives, as its 
inputs, an excitation vector outputted from adder 1050, the 
linear prediction coefficient af>(n) , and the quantized 
15 linear prediction coefficient of >(n) outputted from linear 
prediction coefficient converting circuit 5030. The 
weighting synthesizing filter H(z)W(z) = 

Q(z/y,)/IA(z)Q(*/Y 2 )] ^ which those are set is driven by 
the excitation vector to obtain a weighted reproduced 
20 vector. The transfer function H(z)=l/A(z) of the 
synthesizing filter is represented as follows: 

1 1 



. i=l 
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Differentiator 5060 receives, as its inputs, the 
weighted input vector from weighting filter 5050 and the 
weighted reproduced vector from weighting synthesizing 
filter 5040, and calculates and outputs the difference 
between them as a difference vector to minimization circuit 

5070. 

Minimization circuit 5070 sequentially outputs 
indexes corresponding to all sound source vectors stored in 
sound source signal producing circuit 5110 to sound source 
signal producing circuit 5110, indexes corresponding to all 
delays L pd within a specified range in pitch signal 
producing circuit 5210 to pitch signal producing circuit 
5210, indexes corresponding to all first gains stored in 
first gain producing circuit 6220 to first gain producing 
circuit 6220, and indexes corresponding to all second gains 
stored in second gain producing circuit 6120 to second gain 
producing circuit 6120. Minimization circuit 5070 also 
calculates the norm of the difference vector outputted from 
differentiator 5060, selects the sound source vector, delay, 
first gain and second gain which lead to a minimized norm, 
and outputs the indexes corresponding to the selected 
values to code output circuit 6010. 

Each of pitch signal producing circuit 5210, sound 
source signal producing circuit 5110, first gain producing 
circuit 6220 and second gain producing circuit 6120 
sequentially receives the indexes outputted from 
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minimization circuit 5070. Since each of these pitch 
signal producing circuit 5210, sound source signal 
producing circuit 5110, first gain producing circuit 6220 
and second gain producing circuit 6120 is the same as the 
5 counterpart of pitch signal decoding circuit 1210, sound 
source signal decoding circuit 1110, first gain decoding 
circuit 1220 and second gain decoding circuit 1120 shown in 
Fig. 1 except the connections for input and output, the 
detailed description of each of these blocks is not 

10 repeated. 

Code output circuit 6010 receives the index 
corresponding to the quantized LSP outputted from LSP 
conversion/quantization circuit 5520, receives the indexes 
each corresponding to the sound source vector, delay, first 

15 gain and second gain outputted from minimization circuit 
5070, converts each of the indexes to a code of bit 
sequences, and outputs it through output terminal 40. 

The aforementioned conventional decoding apparatus 
and coding and decoding system have a problem of 

20 insufficient improvement in degradation of decoded sound 
quality in a noise period since the smoothing of the sound 
source gain (second gain) in the noise period fails to 
cause a sufficiently smooth change with time in short time 
average power calculated from the excitation vector. This 

25 is because the smoothing only of the sound source gain does 
not necessarily sufficiently smooth the short time average 
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power of the excitation vector which is derived by adding 
the sound source vector (the second sound source vector 
after the gain multiplication) to a pitch vector (the 
second pitch vector after the gain multiplication) . 
5 Fig. 3 shows short time average power of an 

excitation signal (excitation vector) when sound source 
gain smoothing is performed in a noise period on the basis 
of the aforementioned prior art. Fig. 4 shows short time 
average power of an excitation signal when such smoothing 
10 is not performed. In each of these graphs, the horizontal 
axis represent a frame number, while the vertical axis 
represents power. The short time average power is 
calculated every 80 msec. It can be seen from Fig. 3 and 
Fig. 4 that, when the sound source gain is smoothed 
15 according to the prior art, the short time average power in 
the excitation signal after the smoothing is not 
necessarily smoothed sufficiently in terms of time. 

SUMMARY OF THE INVENTION 
It is an object of the present invention to provide a 
20 decoding method and a coding and decoding method with 

improved degradation of decoded sound quality in a noise 
period. 

It is another object of the present invention to 
provide a decoding apparatus and a coding and decoding 
25 system with improved degradation of decoded sound quality 
in a noise period. 



15 



CA 02317969 2000-09-08 



L 0 ♦ ♦ 

The first object of the present invention is achieved 
by a method of decoding a speech signal by decoding 
information on an excitation signal and information on a 
linear prediction coefficient from a received signal, 
producing the excitation signal and the linear prediction 
coefficient from the decoded information, and driving a 
filter configured with the linear prediction coefficient by 
the excitation signal, the method comprising the steps of: 
calculating a norm of the excitation signal for each fixed 
period; smoothing the calculated norm using a norm obtained 
in a previous period; changing the amplitude of the 
excitation signal in the period using the calculated norm 
and the smoothed norm; and driving the filter by the 
excitation signal with the changed amplitude. 

The second object of the present invention is 
achieved by an apparatus for decoding a speech signal by 
decoding information on an excitation signal and 
information on a linear prediction coefficient from a 
received signal, producing the excitation signal and the 
linear prediction coefficient from the decoded information, 
and driving a filter configured with the linear prediction 
coefficient by the excitation signal, the apparatus 
comprising: an excitation signal normalizing circuit for 
calculating a norm of the excitation signal for each fixed 
period and dividing the excitation signal by the norm; a 
smoothing circuit for smoothing the norm using a norm 
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obtained in a previous period; and an excitation signal 
restoring circuit for multiplying the excitation signal by 
the smoothed norm to change the amplitude of the excitation 

signal in the period. 
5 in the present invention, the excitation signal is 

typically an excitation vector. 

In the present invention, since smoothing is 
performed in a noise period on the norm calculated from the 
excitation vector obtained by adding a sound source vector 
10 (a second sound source vector after gain multiplication) to 
a pitch vector (a second pitch vector after gain 
multiplication) , short time average power is smoothed in 
terms of time in the excitation vector. Therefore, 
improvement can be obtained in degradation of decoded sound 
15 quality in a noise period. 

In the present invention, the smoothing may be 
performed on the norm derived from the excitation vector by 
selectively using a plurality of processing methods 
provided in consideration of the characteristic of an input 
20 signal, not by using single processing. The provided 
processing methods include, for example, moving average 
processing which performs calculations from decoding 
parameters in a limited previous period, auto-regressive 
processing which can consider the effect of a long past 
25 period, or non-linear processing which limits a preset 

value with upper and lower limits after calculation of an 
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average . 

The above and other objects, features, and advantages 
of the present invention will be apparent from the 
following description referring to the accompanying 
5 drawings which illustrate an example of a preferred 
embodiment of the present invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 is a block diagram showing an example of a 
configuration of a conventional speech signal decoding 

10 apparatus ; 

Fig. 2 is a block diagram showing an example of a 
configuration of a conventional speech signal coding 
apparatus; 

Fig. 3 is a graph representing short time average 
15 power of an excitation signal (excitation vector) for which 
smoothing of sound source gain was performed on the basis 
of a conventional method; 

Fig. 4 is a graph representing short time average 
power of an excitation signal (excitation vector) for which 
20 smoothing was not performed; 

Fig. 5 is a block diagram showing a configuration of 
a speech signal decoding apparatus based on a first 
embodiment of the present invention; 

Fig. 6 is a graph representing short time average 
25 power of an excitation signal (excitation vector) for which 
smoothing was performed on a norm calculated from an 
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15 



20 



25 



excitation vector based on the present invention; 

Fig. 7 is a block diagram showing a configuration of 
a speech signal decoding apparatus based on a second 
embodiment of the present invention; 

Fig. 8 is a block diagram showing a configuration of 
a speech signal decoding apparatus based on a third 
embodiment of the present invention; and 

Fig. 9 is a block diagram showing a configuration of 
a speech signal decoding apparatus based on a fourth 
embodiment of the present invention. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
A speech signal decoding apparatus of a first 
embodiment of the present invention shown in Fig. 5 forms a 
pair with the conventional speech signal coding apparatus 
shown in Fig. 2 to constitute a speech signal coding and 
decoding system, and is configured to receive, as its input, 
coded data outputted from the speech signal coding 
apparatus shown in Fig. 2 to perform decoding of the coded 
data. 

The speech signal decoding apparatus shown in Fig. 5 
differs from the conventional speech signal decoding 
apparatus shown in Fig. 1 in that excitation signal 
normalizing circuit 2510 and excitation signal restoring 
circuit 2610 are added and the connections are changed in 
the vicinity of them including adder 1050 and smoothing 
circuit 1320. Specifically, the output from adder 1050 is 
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supplied only to excitation signal normalizing circuit 2510, 
and the output from second gain decoding circuit 1120 is 
directly supplied to second gain circuit 1130, the gain 
from excitation signal normalizing circuit 2510 is supplied 
to smoothing circuit 1320 instead of the output from second 
gain decoding circuit 1120, the shape vector from 
excitation signal normalizing circuit 2510 and the output 
from smoothing circuit 1320 are supplied to excitation 
signal restoring circuit 2610, and the output from 
excitation signal restoring circuit 2610 is supplied to 
synthesizing filter 1040 and to storage circuit 1240 
instead of the output from adder 1050. 

Excitation signal normalizing circuit 2510 calculates 
a norm of the excitation vector outputted from adder 1050 
for each fixed period, and divides the excitation vector by 
the calculated norm. In this speech signal decoding 
apparatus, smoothing circuit 1320 smoothes a norm with a 
norm obtained in a previous period. Excitation signal 
restoring circuit 2610 multiplies the excitation vector by 
the smoothed norm to change the amplitude of the excitation 
vector in that period. 

In Fig. 5, the functional blocks identical to those 
in Fig. 1 are designated the same reference numerals as 
those in Fig. 1. Specifically, since input terminal 10, 
output terminal 20, code input circuit 1010, LSP decoding 
circuit 1020, linear prediction coefficient converting 
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circuit 1030, sound source signal decoding circuit 1110, 
storage circuit 1240, pitch signal decoding circuit 1210, 
first gain decoding circuit 1220, second gain decoding 1120, 
first gain circuit 1230, second gain circuit 1130, adder 
5 1050, smoothing coefficient calculating circuit 1310 and 
synthesizing filter 1040 in Fig. 5 are the same as the 
counterparts in Fig. 1, the description thereof is not 
repeated here. Description is hereinafter made for 
excitation signal normalizing circuit 2510 and excitation 
10 signal restoring circuit 2610. 

Assume herein, similarly to the case shown in Fig. 1, 
that bit sequences are inputted at a frame period of T fr 
(for example, 20 msec), and reproduced vectors are 
calculated at a period (subframe) of T fr /N s£r (for example, 
15 5 msec) where N sfr is an integer number (for example, 4). 

A frame length corresponds to L £r samples (for example, 320 
samples), and a subframe length corresponds to L sfr samples 
(for example, 80 samples). These numbers are employed in 
the case of a sampling frequency of 16 kHz for input 
20 signals . 

Excitation signal normalizing circuit 2510 receives, 
as its input, an excitation vector [x ( £(i)r i=0, . . . ,L sfr -l, 
m=0,...,N sfr -l] in m-th subframe from adder 1050, 
calculates gain and a shape vector from the excitation 
25 vector [xgj.(i)] for each subframe or for each subsubframe 
obtained by dividing a subframe, outputs the calculated 
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gain to smoothing circuit 1320 and the shape vector to 
excitation signal restoring circuit 2610. As the gain, 
such a norm as represented with the following equation is 

used: 



9exc( mN ssfr 



, 2 

sfr 



- + n 

n=0 v J ssfr 

m = 0, . • . ,N s f r -l, 2=0, . . . ,N 8S f r -l 



where N ssfr is the number of division of a subframe (the 
number of subsubframes in a subframe) (for example, two). 
At this point, excitation signal normalizing circuit 2510 
calculates the shape vector obtained by dividing the 
excitation vector [x^i)) by the gain [g e xc(j)r j-0 f ..., 
(N sfr N ssfr -l ) ] with the following equation: 

s (tnN ssrf+ i) (i) = 1 x «jfi -ilM^ + il 

gexc(*-N S sfr+i) V N ssfr J 

i = 0,...,L sfr /N BS f r -l, 2 = 0,...,N S8 f r -l, 
m = 0, . . • »N s fr _1 

Excitation signal restoring circuit 2610 receives, as 
its input, the gain [g e xc(j)» 3=0, • • • , ( N sfr N ssfr -1 ) ] from 
smoothing circuit 1320 and the shape vector [e^i), 
i=0,...,(L 8fr /N 8sfr -l), j=0,...,(N sfr -N 88fr -l)] from 
excitation signal normalizing circuit 2510, calculates a 
smoothed excitation vector with the following equation, and 
outputs the excitation vector to storage circuit 1240 and 
to synthesizing filter 1040: 
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4»)f J .iUII. + i N )=, olc (m.> st£c+ 2).sS c tl ^ J >(i) 
V N ssfr ) 

i = 0,...,L 8fr /N ssfr -l, 1 = 0 , . . . ,N ssf r -l , 
m = 0, . . . /N s f r -1 

in the speech signal decoding apparatus shown in Fig. 
5 5, adder 1050 adds a sound source vector after it is 

multiplied by gain to a pitch vector after it is multiplied 
by gain to produce an excitation vector. Excitation signal 
normalizing circuit 2510, smoothing circuit 1320 and 
excitation signal restoring circuit 2610 smooth the norm 
10 calculated from the excitation vector in a noise period. 
As a result, short time average power in the excitation 
vector is smoothed in terms of time to improve degradation 
of decoded sound quality in the noise period. 

Fig. 6 shows short time average power of an 
excitation vector after smoothing for the norm calculated 
from the excitation vector in a noise period. The 
horizontal axis represents a frame number, while the 
vertical axis represents power. The short time average 
power is calculated for every 80 msec. It can be seen from 
20 Fig. 6 that the smoothing according to the embodiment 

causes smoothed short time average power in the excitation 
vector (excitation signal) in terms of time. 

Fig. 7 shows a speech signal decoding apparatus of a 
second embodiment of the present invention. The speech 
25 signal decoding apparatus shown in Fig. 7 differs from the 



15 
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speech signal decoding circuit shown in Fig. 5 in that 
4ir st switching circuit 2U0 ana £ irst to third alters 
21 50. 2X60 and 2170 are provided instead of smoothing 
circuit 1320 for performing processing in accordance with 
, the characteristic of an input signal, nothing 

coefficient calculating circuit 1310 is eliminated, and 
sound present/absent discriminating circuit 2020 rs 
provided for discriminating between a sound present period 
end a sound absent period, noise classifying circuit 2030 
10 is provided for classifying noise, power calculating 
circuit 3040 is provided for calculating power of a 
produced vector, and speech .ode determining circuit 3050 
ls provided for determining a speech mode SW,, later 

» functions as a smoothing circuit, but the contents of 

th eir smoothing processing performed are different from one 

another . 

The speech signa! decoding apparatus shown in Fig. 
also forms a pair with the conventional art speech signal 
» coding apparatus shown in rig. 2 to constitute a speech 
signal coding and decoding system, and is configured to 
r eceive coded data outputted from the speech signal coding 
apparatus shown in Fig. 2 to perform decoding of the coded 
data, in Fig. 7, the functional blocks identical to those 
26 in Fig. 5 are designated the same reference numerals as 
those in Fig. 5. 
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Description is hereinafter made for power calculating 
circuit 3040, speech mode determining circuit 3050, sound 
present/absent discriminating circuit 2020, noise 
classifying circuit 2030, first switching circuit 2110, 
5 first filter 2150, second filter 2160 and third filter 2170. 
Power calculating circuit 3040 is supplied with a 
reproduced vector from synthesizing filter 1040, calculates 
power from sum of squares of the reproduced vectors, 
outputs the calculation result to sound present/absent 
10 discriminating circuit 2020. Assume herein that power is 
calculated for each subframe, and power in m-th subframe is 
calculated using a reproduced vector output ted from 
synthesizing filter 1040 in (m-l)th subframe. Assuming 
that the reproduced vector is [S syn (i)/ i=0 , • • • ,L sfr ] , power 
15 (Epow) is calculated with the following equation: 



1 Lsfr_1 2 
Epow = - Z SsynCO 



L sfr i=0 



Instead of the above equation, for example, a norm 
for a reproduced vector represented by the following 
equation may be used: 



lLsfr-1 2 

20 E pow = X SsynOO 

V i=0 



Speech mode determining circuit 3050 is supplied with 
a previous excitation vector [e me m(i) » i=0 » • • • ' (Lmem-l) 1 
held in storage circuit 1240 and with an index from code 
input circuit 1010. This index specifies a delay L pd . The 
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10 



Lmem is a constant determined by the maximum value of the 
L pd . In m-th subframe, speech mode determining circuit 
3050 calculates a pitch prediction gain [G em em( m ) > 
m=l, . • . ,N s£r ] as follows, from the previous excitation 
vector e mem (i) and the delay L pd : 

Gemem(m) = 10 lo 9l0 (9emem(m) ) 

t \ 1 
where q esaBS[ {^)= ZJ7~{ 

E c (m) 



15 



20 



E al(m) E a2(m) 

Lsfr-1 2 

E al( m ) = H e mem( i ) 
i=0 

Lsfr" 1 

E a2( m )= Z e mem( i - L pd) 

i=0 

Lsfr-1 

E c( m )= Z e mem( i ) e mem( i_L pd) 
i=0 

Speech mode determining circuit 3050 performs the following 
threshold value processing on the pitch prediction gain 
G emem (m), or an in-frame average value G^n) in n-th 
frame for the G emem {m) , thereby setting a speech mode S mo d e : 

if (Gemem(n)* 3 - 5 ) then S mode =2 

else S mode =0 

Speech mode determining circuit 3050 outputs the speech 
mode S m ode to sound present/absent discriminating circuit 
2020. 

Sound present/absent discriminating circuit 2020 
receives, as its inputs, the LSP: q$ m) (n) outputted from LSP 
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decoding circuit 1020, the speech mode S mod e outputted from 
speech mode determining circuit 2050, and the power 
outputted from power calculating circuit 3040. The 
procedure for deriving the amount of variations in spectrum 
5 parameter in sound present/absent discriminating circuit 
2020 is given below. The LSP:qJ m) (n) is used herein as the 
spectrum parameter. In n-th frame, a long time average 
qj(n) of the LSP is calculated with the following equation: 

qj(n) = Po " qj(n - + 0 " Po)<l ( j Nsfr) (n) 
10 j = If • • • 

where p 0 =0.9. A variation amount d q (n) of the LSP in n-th 
frame is defined with the following equation: 

H j=lm=l 9j( n ) 
where Dj}(n) corresponds to the distance between q~j(n) and 
15 qf°(n). For example, one of the following equations may be 
used: 

DS(n)=(qj(»)-4P( n )) 2 



or 



20 The latter is used in this case. Generally, a period with 
a large variation amount d q (n) corresponds to a sound 
present period, while a period with a small variation 
amount d q (n) corresponds to a sound absent period (noise 
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period) . However, there is a problem that a threshold 
value for discriminating between the sound present period 
and sound absent period is not easily set since the 
variation amount exerts large variations with time and the 

5 range of values of variation amounts in the sound present 
period overlaps with the range of values of variation 
amounts in the sound absent period. Thus, the long time 
average of the variation amount d q (n) is used for 
discrimination between the sound present period and sound 

10 absent period. A long time average d ql (n) is derived using 
a linear filter or a non-linear filter. The average value, 
median value, mode of the variation amount d q (n) or the 
like can be applied thereto, for example. In this case, 
the following equation is used: 

15 d ql (n) = P r d ql (n-l) + (l-P,)-d q (n) 

where pj =05 . 

With threshold processing for the average value, a 
discrimination flag S vs is determined as follows: 
if (d ql (n)>C thI ) then S vs =l 

20 else S vs =0 

where C t hi is a constant (for example, 2.2), and S V s = 1 
corresponds to a sound present period, while S v s = 0 
corresponds to a sound absent period. Since a period with 
high constancy has a small S V s even in the sound present 

25 period, it may be erroneously considered as a sound absent 
period. Thus, when a frame has large power and pitch 
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prediction gain is large in a period, the period should be 
considered as a sound present period. At this point, the 
S vs is modified by the following additional determination: 

if (Enns^CnM and S mode >2) then S vs =l 

else S vs = 0 

where C rms is a certain constant (for example, 10000). 
S mo<Je £2 corresponds to the in-frame average value G op (n) of 
the pitch prediction gain equal to or higher than 3.5 dB. 
Sound present/absent discriminating circuit 2020 outputs 
the discrimination flag S vs to noise classifying circuit 
2030 and to first switching circuit 2110, and outputs 
d q i(n) to noise classifying circuit 2030. 

Noise classifying circuit 2030 receives, as its input, 
d ql (n) and the discrimination flag S vs outputted from sound 
present/absent discriminating circuit 2020. In a sound 
absent period (noise period), a linear filter or a non- 
linear filter is used to derive a value d q2 (n) which 
reflects average behaviors of d ql (n) . When the S vs = 0, 
the following equation is calculated: 

d q2 (n) = p 2 d q2 (n-l) + (l-P 2 )d ql (n) 

where p 2 = 0.94 . 

With threshold processing for d q2 (n), noise is 
classified, and a classification flag S vs is determined as 
follows : 

if (d^Cn^Cth;) then S nz — 1 

else S n7 = 0 
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where C th2 is a certain constant (for example, 1.7), and 

S n2 = 1 corresponds to noise having a frequency 

characteristic inconstantly changing with time, while S n2 = 

0 corresponds to noise having a frequency characteristic 

6 constantly changing with time. Noise classifying circuit 

2030 outputs the S nz to first switching circuit 2110. 

First switching circuit 2110 receives, as its inputs, 

j_ n i n , -N D ^-1)] outputted from 

the gaxn [gexcO)' ]-"#•■•» l w ssfr w sfr ' 

excitation signal normalizing circuit 2510, the 

10 discrimination flag S vs from sound present/absent 

discriminating circuit 2020, and the classification flag 

Snz from noise classifying circuit 2030. First switching 

circuit 2110 switches a switch in accordance with the value 

of the discrimination flag and the value of the 

15 classification flag, thereby outputting the gain g ex c(j) to 

first filter 2150 if S vs = S nz = 0, to second filter 2160 

if S vs = 0 and S« = 1, or to third filter 2170 if S vs = 1. 

First filter 2150 receives, as its input, the gain 

... .,_ n / N , -n fr -l)1 from first switching 

[9exc(j)/ j-Of • • • i \ "ssfr n sfr 1 J ■> 

20 circuit 2110, smoothes it with a linear filter or a non- 
linear filter to produce a first smoothed gain gexc,i 

(j), and 

outputs it to excitation signal restoring circuit 2610. In 
this case, the filter represented by the following equation 
is used: 

25 g exc , 1 (n) = Y 2 igexc,l(n-l) + (l-Y2l)-9exc(n) 

where fexc.iC- 1 ) corresponds to g e xc,l( N ssfr N sfr ~0 in tne 
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previous frame. Also, y 2 i=0.94. 

Second filter 2160 smoothes the gain outputted from 
first switching circuit 2110 using a linear filter or a 
non-linear filter to produce a second smoothed gain 
5 9exc,2(j) which is then outputted to excitation signal 
restoring circuit 2160. In this case, the filter 
represented by the following equation is used: 

9exc,2 (" ) = Y 22 ' 9exq,2 " 1) + 0 ~ Y 22 ) * 9 exc (* ) 
where q ma (-l) corresponds to g exCl 2( N ssfr " N sfr "0 in the 
10 previous frame. Also, y 22 =0.9. 

Third filter 2170 receives, as its input, the gain 
outputted from first switching circuit 2110, smoothes it 
with a linear filter or a non-linear filter to produce a 
third smoothed gain g exc 3 (n) ' anc * outputs it to excitation 
15 signal restoring circuit 2160. In this case, 

9exc,3( n ) = 9exc( n ) • 

As described above, in the speech signal decoding 

apparatus shown in Fig. 7, first filter 2150, second filter 

2160 and third filter 2170 can perform different smoothing 

20 processing, and power calculating circuit 3040, speech mode 
determining circuit 3050, sound present/sound absent 
discriminating circuit 2020 and noise classifying circuit 
2030 can identify the nature of an input signal. The 
switching of the filters in accordance with the identified 

25 nature of the input signal enables smoothing processing of 
the excitation signal to be performed in consideration of 
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the characteristics of the input signal. As a result, 
optimal processing is selected according to background 
noise to allow further improvement in degradation of 
decoded sound quality in a noise period. 
5 Fig. 8 shows a speech signal decoding apparatus of a 

third embodiment of the present invention. The speech 
signal decoding apparatus shown in Fig. 8 differs from the 
speech signal decoding apparatus shown in Fig. 5 in that 
input terminal 50 and second switching circuit 7110 are 

10 added and the connections are changed. The speech signal 
decoding apparatus shown in Fig. 8 also forms a pair with 
the conventional speech signal coding apparatus shown in 
Fig. 2 to constitute a speech signal coding and decoding 
system, and is configured to receive coded data outputted 

15 from the speech signal coding apparatus shown in Fig. 2 to 
perform decoding the coded data. In Fig. 8, the functional 
blocks identical to those in Fig. 5 are designated the same 
reference numerals as those in Fig. 5. 

A switching control signal is supplied from input 

20 terminal 50. Second switching circuit 7110 receives an 
excitation vector outputted from adder 1050, and outputs 
the excitation vector to synthesizing filter 1040 or to 
excitation signal normalizing circuit 2510 in accordance 
with the switching control signal. Therefore, the speech 

25 signal decoding apparatus can select whether the amplitude 
of the excitation vector is changed or not in accordance 
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with the switching control signal. 

Fig. 9 shows a speech signal decoding apparatus of a 
fourth embodiment of the present invention. The speech 
signal decoding apparatus differs from the speech signal 
decoding apparatus shown in Fig. 7 in that input terminal 
50 and second switching circuit 7100 are added and the 
connections are changed. The speech signal decoding 
apparatus shown in Fig. 9 also forms a pair with the 
conventional speech signal coding apparatus shown in Fig. 2 
to constitute a speech signal coding and decoding system, 
and is configured to receive coded data outputted from the 
speech signal coding apparatus shown in Fig. 2 to perform 
decoding the coded data. In Fig. 9, the functional blocks 
identical to those in Fig. 7 are designated the same 
reference numerals as those in Fig. 7. 

A switching control signal is supplied from input 
terminal 50. Second switching circuit 7110 receives an 
excitation vector outputted from adder 1050, and outputs 
the excitation vector to synthesizing filter 1040 or to 
excitation signal normalizing circuit 2510 in accordance 
with the switching control signal. Therefore, the speech 
signal decoding apparatus can select whether the amplitude 
of the excitation vector is changed or not in accordance 
with the switching control signal, and if the amplitude of 
the excitation vector is to be changed, smoothing 
processing can be switched in accordance with the 
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characteristic of the input signal. 

While preferred embodiments of the present invention 
have been described using specific terms, such description 
is for illustrative purposes only, and it is to be 
understood that changes and variations may be made without 
departing from the spirit or scope of the following claims. 
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What Is Claimed Is: 

1. A method of decoding a speech signal by decoding 
information on an excitation signal and information on a 
linear prediction coefficient from a received signal, 
producing said excitation signal and said linear prediction 
5 coefficient from said decoded information, and driving a 
filter configured with said linear prediction coefficient 
by said excitation signal, said method comprising the steps 
of: 

calculating a norm of said excitation signal for each 
10 fixed period; 

smoothing said calculated norm using a norm obtained 
in a previous period; 

changing amplitude of said excitation signal in said 
period using said calculated norm and said smoothed norm; 
15 and 

driving said filter by said excitation signal with 
the changed amplitude. 

2. The method of decoding a speech signal according 
to claim 1, wherein said excitation signal is an excitation 
vector . 

3. The method of decoding a speech signal according 
to claim 1, wherein the amplitude of said excitation signal 
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is changed by dividing said excitation signal in said 
period by said norm, and multiplying said excitation signal 
by said smoothed norm in said period. 

4. The method of decoding a speech signal according 
to claim 3, wherein said excitation signal with the changed 
amplitude is switched to and from the excitation signal 
with an unchanged amplitude in accordance with an inputted 
switching signal, and said filter is driven by the switched 
excitation signal. 

5. The method of decoding a speech signal according 
to claim 1, wherein said received signal is a signal coded 
by representing a input speech signal with an excitation 
signal and a linear prediction coefficient. 

6. The method of decoding a speech signal according 
to claim 1, further comprising the step of discriminating 
between a sound present period and a noise period for said 
received signal using said decoded information, and wherein 
the said calculating step, said smoothing step, said 
changing step and said driving step are performed in said 
noise period. 

7. The method of decoding a speech signal according 
to claim 6, wherein said excitation signal is an excitation 
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vector, 

8. The method of decoding a speech signal according 
to claim 6, wherein the amplitude of said excitation signal 
is changed by dividing said excitation signal in said 
period by said norm, and multiplying said excitation signal 
by said smoothed norm in said period. 

9. The method of decoding a speech signal according 
to claim 6, wherein nature of said received signal in said 
noise period is identified based on said decoded 
information, and processing contents at the said smoothing 
step are selected based on said identified nature. 

10. The method of decoding a speech signal according 
to claim 8, wherein said excitation signal with the changed 
amplitude is switched to and from the excitation signal 
with an unchanged amplitude in accordance with an inputted 
switching signal, and said filter is driven by the switched 
excitation signal . 

11. The method of decoding a speech signal according 
to claim 6, wherein said received signal is a signal coded 
by representing a input speech signal with an excitation 
signal and a linear prediction coefficient. 
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12. An apparatus for decoding a speech signal by 
decoding information on an excitation signal and 
information on a linear prediction coefficient from a 
received signal, producing said excitation signal and said 
linear prediction coefficient from said decoded information, 
and driving a filter configured with said linear prediction 
coefficient by said excitation signal, said apparatus 
comprising: 

an excitation signal normalizing circuit for 
calculating a norm of said excitation signal for each fixed 
period and dividing said excitation signal by said norm; 

a smoothing circuit for. smoothing said norm using a 
norm obtained in a previous period; and 

an excitation signal restoring circuit for 
multiplying said excitation signal by said smoothed norm to 
change amplitude of said excitation signal in said period. 

13, The apparatus of decoding a speech signal 
according to claim 12, wherein said excitation signal is an 
excitation vector. 

14. The apparatus of decoding a speech signal 
according to claim 12, further comprising a sound 
present/absent discriminating circuit for discriminating 
between a sound present period and a noise period for said 
received signal using said decoded information, and wherein 
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the amplitude of said excitation signal is changed in said 
noise period. 

15. The apparatus of decoding a speech signal 
according to claim 14, further comprising a noise 
classifying circuit for identifying nature of said received 
signal in said noise period using said decoded information, 
and wherein said smoothing circuit includes a plurality of 
smoothing filters with characteristics different from one 
another, and one of said smoothing filters is selected in 
accordance with said identified nature. 

16. The apparatus of decoding a speech signal 
according to claim 15, wherein said excitation signal is an 
excitation vector. 

17 . The apparatus of decoding a speech signal 
according to claim 12, further comprising a switching 
circuit for providing said excitation signal produced from 
said decoded information to one of said excitation signal 
normalizing circuit and said filter in accordance with an 
inputted switching signal. 

18. The apparatus of decoding a speech signal 
according to claim 12, wherein said received signal is a 
signal coded by representing a input speech signal with an 
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excitation signal and a linear prediction coefficient. 

19. The apparatus of decoding a speech signal 
according to claim 15, wherein said received signal is a 
signal coded by representing a input speech signal with an 
excitation signal and a linear prediction coefficient. 
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